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ABSTRACT 

Peptide nucleic acids (PNAs) have been developed 
for applications in biotechnology and therapeutics. 
There is great potential in the development of 
chemically modified PNAs or other triplex-forming 
ligands that selectively bind to RNA duplexes, but 
not single-stranded regions, at near-physiological 
conditions. Here, we report on a convenient 
synthesis route to a modified PNA monomer, thio- 
pseudoisocytosine (L), and binding studies of 
PNAs incorporating the monomer L. Thermal 
melting and gel electrophoresis studies reveal that 
L-incorporated 8-mer PNAs have superior affinity 
and specificity in recognizing the duplex region 
of a model RNA hairpin to form a pyrimidine motif 
major-groove RNA 2 -PNA triplex, without appre- 
ciable binding to single-stranded regions to form 
an RNA-PNA duplex or, via strand invasion, 
forming an RNA-PNA 2 triplex at near-physiological 
buffer condition. In addition, an L-incorporated 
8-mer PNA shows essentially no binding to single- 
stranded or double-stranded DNA. Furthermore, an 
L-modified 6-mer PNA, but not pseudoisocytosine 
(J) modified or unmodified PNA, binds to the HIV-1 
programmed -1 ribosomal frameshift stimulatory 
RNA hairpin at near-physiological buffer conditions. 
The stabilization of an RNA 2 -PNA triplex by L modi- 
fication is facilitated by enhanced van der Waals 
contacts, base stacking, hydrogen bonding and 
reduced dehydration energy. The destabilization 
of RNA-PNA and DNA-PNA duplexes by L modifica- 
tion is due to the steric clash and loss of two 
hydrogen bonds in a Watson-Crick-like G-L pair. 
An RNA 2 -PNA triplex is significantly more stable 
than a DNA 2 -PNA triplex, probably because the 



RNA duplex major groove provides geometry 
compatibility and favorable backbone-backbone 
interactions with PNA. Thus, L-modified triplex- 
forming PNAs may be utilized for sequence- 
specifically targeting duplex regions in RNAs for 
biological and therapeutic applications. 

INTRODUCTION 

RNAs have an expanding list of biological functions 
including coding proteins, catalysis, gene regulation and 
immunomodulation (1-10). The functions of many RNAs 
are determined by the diverse structures they may form. 
RNA secondary structures are comprised of both single- 
stranded loop and double-stranded stem regions. RNA 
tertiary structures involve interactions between the 
secondary structure building blocks: for example, loop- 
stem, loop-loop and stem-stem interactions. RNA struc- 
tures and functions are further diversified upon binding 
to proteins to form functional ribonucleoproteins such 
as snRNPs (11) and to other molecules and ions (as is 
the case in metabolite-sensing riboswitches) (1,7,8,10). 
Antisense therapeutics and microarray technologies 
(12-15) involve sequence-specific binding of oligonucleo- 
tides to single-stranded regions of target RNAs and 
have both had major impacts on biology and advancing 
RNA-based therapeutics. In contrast, there are no widely 
applied methods for targeting double-stranded regions of 
RNA; thus there is great potential for developing methods 
that can target double-stranded regions of RNA, which 
make up the majority of nucleotides in many functional 
RNAs (16,17). 

DNA and RNA duplexes are recognized by triplex- 
forming oligonucleotides (TFOs) through sequence- 
specific hydrogen bonding and base stacking interactions 
(18-26). Thus, TFOs have a great potential in biotechnol- 
ogy and therapeutics. However, formation of major 
groove C + *G-C base triples is favored at relatively low 
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pH (<6.0) due to the fact that cytosine TV 3 positions 
(p^a = 4.5 for a C monomer) in TFOs need to be pro- 
tonated to form stable C +- G Hoogsteen pairs 
(Figure 1A). Numerous studies have been reported on 
enhancing DNA triplex stability at near neutral pH 
using chemically modified TFOs (18-20). For example, 
TFOs incorporating a neutral base pseudoisocytosine (J) 
(see Figure 1 B for the nucleobase structure) show minimal 
pH dependence in DNA 2 -TFO triplex formation (19,28). 

Peptide nucleic acids (PNAs), where the negatively 
charged sugar-phosphate backbone of natural nucleic 
acids is replaced by a neutral pseudopeptide backbone 
(see Figure 1 for the PNA backbone structure), show sig- 
nificantly enhanced (compared to unmodified DNA or 
RNA) binding affinity towards DNA duplexes (29-31). 
However, PNAs may also bind tightly to single-stranded 
DNAs (ssDNAs) in both parallel and anti-parallel orien- 
tations. Thus, strand invasion may occur, resulting in for- 
mation of DNA-PNA duplexes, DNA-PNA 2 triplexes 
and other structures, instead of DNA 2 -PNA triplexes 
(29-37). PNAs incorporated with J monomers 



(Figure IB and E) show minimal pH dependence in 
forming DNA-PNA 2 triplexes (35). 

A number of reports are focused on targeting RNA 
duplexes by RNA 2 -TFO triplex formation (21-24,38). 
Recently, RNA 2 -PNA triplex formation with minimal 
strand invasion was reported for unmodified and 
modified PNAs (39^13). It remains unclear whether it 
is possible to develop chemically modified PNAs or other 
triplex-forming ligands that bind to RNA duplex but not 
single-stranded regions at near-physiological condition. 

A DNA TFO containing a ribonucleoside thio- 
pseudoisocytosine (see Figure 1C for the nucleobase 
structure) was found to enhance the formation of a 
DNA 2 DNA triplex (44). We reason that thio- 
pseudoisocytosine has steric repulsion with G in a 
Watson-Crick-like pair (Figure IF and I), but has 
enhanced van der Waals interactions with G in a 
Hoogsteen-like pair (Figure 1C and H). Thus, incorpor- 
ation of thio-pseudoisocytosine monomers into PNAs 
may stabilize the RNA 2 -PNA triplex formation, 
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Figure 1. Chemical structures and structural models of base triples and base pairs formed between PNA (blue) and RNA (black). H 3 and S 4 atoms 
in bases C + , J and L are shown in red. (a-d) Chemical structures of base triples of C +- G-C, J*G-C, L-G-C and T-A-U. L has an enhanced van der 
Waals interaction with G in a Hoogsteen-like L-G pair, (e-f) Chemical structures of Watson-Crick-like G-J and G-L pairs, (g-i) Structural models 
of base triples J-G-C, L-G-C and Watson-Crick-like G-L pair. The three dimensional coordinates are based on a C +, G-C base triple from (26), 
assuming the structures do not change upon chemical modifications of base C. The numbers shown are inter-atomic distances in A. van der Waals 
radii of N, O, S atom groups are ~1.6, 1.5 and 1.8 A, respectively (27). A steric clash occurs between G and L in a Watson-Crick-like G-L pair, if 
the base pairing interface of the G-L pair is maintained the same shape as that of a Watson-Crick G-C pair. 
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destabilize the RNA-PNA duplex formation, and 
minimize the pH and salt dependence of RNA 2 -PNA 
triplex formation. 

Here, we report on the synthesis and binding studies 
of PNAs incorporated with the modified PNA monomer, 
thio-pseudoisocytosine (L, Figure 1C and F). Thermal 
melting and gel electrophoresis studies were carried out 
to (i) determine the molecular determinants of and envir- 
onment factors affecting parallel pyrimidine motif RN A 2 - 
PNA triplex stability, (ii) characterize the binding of PNAs 
to a DNA duplex and ssRNA and ssDNA and (hi) test 
the application of modified PNAs in targeting an HIV-1 
ribosomal frameshift inducing RNA structure (45,46). 

MATERIALS AND METHODS 

General methods and synthesis of PNA monomer L 

All anhydrous solvents were obtained from commercial 
sources. Reagents were used as received from commercial 
sources, without any further purification. Unless otherwise 
noted, commercial HPLC grade solvents and room tem- 
perature were used for all reactions. Thin layer chromatog- 
raphy (TLC) was performed to monitor reaction progress 
with aluminum sheets silica gel 60 F254 (Merck). Flash 
silica gel 23(M100 mesh and ethyl acetate/petroleum ether 
mixture were used as eluting solvent for column chroma- 
tography. All 'H and 13 C NMR spectra were acquired on 
a 300 MHz 'H (75 MHz, 13 C) spectrometer. The mass of 
all compounds was characterized by high resolution 
mass spectrometry (electron ionization) [HRMS (EI)]. 
Chemically synthesized and reversed phase-high-perform- 
ance liquid chromatography (RP-HPLC) purified RNA 
and DNA oligonucleotides were purchased from Sigma- 
Aldrich in Singapore. The detailed procedure for the 
synthesis of the modified PNA monomer L is shown in 
the Supplementary Material. 

Solid phase synthesis, purification and matrix-assisted 
laser desorption/ionization-time of flight analysis of PNA 
oligomers 

The N-(2-aminoethyl)glycine PNA (aegPNA) monomers 
were purchased from ASM Research Chemicals. PNA 
monomer J was synthesized following the reported 
method (35,47). Synthesis of PNA oligomers was carried 
out on 4-methylbenzhydrylamine hydrochloride (MBHA* 
HC1) polystyrene based resin. The original loading value 
1.5-1.7 mmol/g of this solid support was reduced to 
0.35mmol/g, using acetic anhydride as the capping 
reagent. (Benzotriazol- 1 -yloxy)tripyrrolidinophospho 
nium hexafluorophosphate (PyBop) and N,N- 
Diisopropylethylamine (DIPEA) were used as the 
coupling reagent and Boc strategy was followed during 
oligomer synthesis. After sequential deprotection of t-Boc 
group and coupling of aeg/modified PNA monomers on 
solid support, final cleavage of the oligomers were done 
by using 'high-low triffuoroacetic acid (TFA)-trifluoro- 
methanesulfonic acid (TFMSA)' method. Oligomers were 
then precipitated with diethyl ether, dissolved in water and 
purified by RP-HPLC method using water-CH 3 CN-0.1% 
TFA as the mobile phase. Sample crystallization matrix 



a-cyano-4-hydroxycinnamic acid (CHCA) was used in 
matrix-assisted laser desorption/ionization-time of flight 
(MALDI-TOF) to characterize the oligomers. 

Thermal melting 

UV absorbance versus temperature experiments were con- 
ducted using a Beckmann Coulter DU-800 spectrometer 
equipped with a Peltier temperature controller. 
Absorbance at 280 nm was recorded with increasing tem- 
perature from 15 to 95°C with a ramp rate of 0.5°C/min. 
The quartz cuvettes have an optical path length of 1 cm. 
Prior to each melting experiment, the RNA or DNA 
hairpin solution was heated at 95°C for 5 min and 
quickly cooled to ~0°C (snap cooling). Subsequently, 
PNA/oligonucleotide was added to the snap cooled RNA 
or DNA hairpin solution and annealed by heating at 65°C 
for 5 min and slowly cooling to room temperature followed 
by incubation at 0°C for 3-5 h. The solution was covered 
with silicon oil to prevent evaporation during the melting 
experiment. The final concentration of each strand is 5 uM. 
T^'s were determined from the Gaussian fits of the first- 
derivative of the normalized curves. The buffers for 
studying salt concentration dependence contain 0.5 mM 
ethylenediaminetetraacetic acid (EDTA), 20mM4-(2- 
hydroxyethyl)-l-piperazineethanesulfonic acid (HEPES), 
pH 7.5 with varying [NaCl] (10, 100, 200, 300, 500 and 
1000 mM). The buffers for studying pH dependence 
contain 200 mM NaCl, 0.5 mM EDTA, 20mM2-(A- 
morpholino)ethanesulfonic acid (MES) (pH 5.5 and 6.0) 
or HEPES (pH 6.5, 7.0, 7.5, 8.0, 8.5 and 9.0). We did not 
obtain equilibrium thermodynamic parameters from the 
thermal melting curves due to the hysteresis observed 
between heating and cooling curves. A ramp rate at 
0.2°C/min did not reduce hysteresis significantly. 

Non-denaturing polyacrylamide gel electrophoresis 

Non-denaturing polyacrylamide gel electrophoresis 
(PAGE) (12%) experiments were conducted with the 
sample incubation buffer containing 10 mM NaCl, 
0.5 mM EDTA, 20 mM MES (pH 5.5), or 200 mM 
NaCl, 0.5 mM EDTA, 20 mM HEPES (pH 7.0, 7.5 and 
8.0). The loading volume was 20 or 30 ul Samples were 
prepared by snap cooling of the hairpin followed by an- 
nealing with PNAs/oligonucleotides by slow cooling from 
65°C to room temperature followed by incubation at 4°C 
overnight. Thirty-five percent glycerol (20% of the total 
loaded volume) was added to the sample mixtures just 
before loading the samples into the wells. 1 x TBE (Tris- 
Borate-EDTA) buffer, pH 8.3 was used as the running 
buffer for all experiments. The gel was run at 4°C at a 
voltage 250 V for 6h. Gels were stained with ethidium 
bromide and imaged by a Typhoon phosphorimager. 

RESULTS AND DISCUSSION 

Synthesis of PNA monomer L and modified PNA 
oligomers 

A convenient route has been developed for the synthesis of 
PNA L monomer 6 protected at the N 2 exocyclic amine 
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and C 4 thiocarbonyl group (Scheme 1). To synthesize 
methyl ^-(benzyloxycarbonyfjisocytosin-S-ylacetate 1, 
we employed a previously reported method (35,47). The 
C 4 carbonyl of 1 was converted to its C 4 thiocarbonyl 
derivative 2 using Lawesson's reagent (48-50). The 
sulfur group of the thiocarbonyl compound 2 was pro- 
tected with 4-methoxybenzyl chloride in the presence of 
triethylamine to obtain the N 2 and S protected nucleobase 
methyl /V r2 -(benzyloxycarbonyl)-.v-(4-methoxybenzyl) 
isocytosin-5-ylacetate 3. The hydrolysis of 3 was accom- 
plished using aqueous lithium hydroxide (LiOH), and 
a solid compound, 7V 2 -(benzyloxycarbonyl)-5^- 
(4-methoxybenzyl)isocytosin-5-ylacetic acid 4, was 
obtained after acidification with 2M hydrochloric acid. 



Compound 5 was obtained by coupling of 4 with the 
PNA backbone, i.e. ethyl A^-(2-Boc-aminoethyl)glycinate 
(29,51) using l-Ethyl-3-(3-dimethylaminopropyl)car- 
bodiimide (EDQ/DIPEA as the coupling reagent. 
Further hydrolysis of 5 with aqueous LiOH followed by 
acidification with dowex cation exchange resin, yielded the 
final PNA monomer 6 in good yield. The protecting 
groups on jV 2 exocyclic amine and C 4 thiocarbonyl 
group of monomer 6 were liberated under strong acidic 
conditions during the cleavage of PNA oligomers from 
resin after completion of solid phase synthesis. 'H and 
13 C NMR were used to characterize the compounds 
(Supplementary Figures S1-S6). A series of 8- and 6-mer 
PNAs with various modifications (Table 1 , Supplementary 
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Scheme 1. Reagents and conditions for PNA L monomer synthesis: (A) Lawesson's reagent, THF, 0°C-rt, overnight, 54%. (B) 4-methoxybenzyl 
chloride, DCM, 1 h, 72%. (C) 1 M aqLiOH, THF, 1 h, 2M HC1, 78%. (D) Ethyl Af-(2-Boc-aminoethyl)glycinate, EDC-HC1, DIPEA, DMF, 4h, 
0°C-rt, 58%. (E) 1M aq LiOH, THF, 1 h, 70%. 



Table 1. Thermal stability comparison of triplexes (RNA 2 -PNA and DNA 2 -PNA) and duplexes (RNA-PNA and DNA-PNA) 



PNA or Sequences" Binding to rHPl or dHP Binding to ssRNA or ssDNA b 
oligonucleotide 







T ml 


Ar ml 


T 




P8 


LysNH-TCTCTTTC-CONH 2 


38.1 (<20) 


NA 


58.1 (37.7) 


NA 


Jl-2 


LysNH-TJTCTTTC-CONH 2 


38.1 


0 


52.3 


-5.8 


Jl-4 


LysNH-TCTJTTTC-CONH 2 


28.9 


-9.2 


52.1 


-6.0 


Jl-8 


LysNH-TCTCTTTJ-CONH 2 


25.5 


-12.6 


57.6 


-0.5 


J2-2,4 


LysNH-TJTJTTTC-CONH 2 


33.5 


-4.6 


45.8 


-12.3 


J3 


LysNH-TJTJTTTJ-CONH 2 


29.9 (<20) 


-8.2 


43.9 (37.7) 


-14.2 


Ll-2 


LysNH-TLTCTTTC-C0NH 2 


45.0 


+6.9 


44.0 


-14.1 


Ll-4 


LysNH-TCTLTTTC-CONH 2 


43.7 


+5.6 


52.3 


-5.8 


Ll-8 


LysNH-TCTCTTTL-CONH 2 


36.0 


-2.1 


53.7 


-4.4 


L2-2,4 


LysNH-TLTLTTTC- CONH 2 


55.6 


+ 17.5 


36.3 


-21.8 


clL2-2,4 


H 2 N-TLTLTTTC -Lys -CONH 2 


54.0 


+ 15.9 


38.2 


-19.9 


L3 


LysNH-TLTLTTTL- CONH 2 


64.1 (<20) 


+26.0 


27.8 (<20) 


-30.3 


apL3 


LysNH - LTTTLTLT - CONH 2 


<20 


NA 


29.9 


-28.2 


R8 


5'-UCUCUUUC-3' 


<20 


NA 


33.0 


-25.1 


D8 


5'-TCTCTTTC-3' 


<20 


NA 


25.1 


-33.0 



Buffer condition is 200 mM NaCl, 0.5mM EDTA, 20 mM HEPES. pH 7.5. The concentration of each strand is 5uM. All melting temperatures are 
shown in °C. "All sequences have PNA backbone except that R8 and D8 have ribose-phosphate and deoxyribose-phosphate backbones, respectively. 
b The sequence of the ssRNA and ssDNA is 5'-AGAG AGAGAAAG -3' (Figure 2F and G), with ribose-phosphate and deoxyribose-phosphate 
backbones, respectively. The values shown in parentheses are for the binding to dHP or ssDNA. dHP is homologous to rHPl. PNA L3 shows the 
highest RNA 2 -PNA triplex-forming T ml and lowest RNA-PNA duplex-forming T m . NA, Not applicable. 
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Table SI and Figure S7) were synthesized by solid phase 
peptide synthesis method. A lysine residue was 
incorporated at the N-terminus of the oligomers unless 
otherwise noted. 

L modification enhances RNA 2 -PNA triplex stability with 
reduced pH dependence 

To test whether RNA 2 -PNA triplexes may form at near- 
physiological conditions, we first studied the binding of 
the 8-mer PNAs towards a model RNA hairpin (rHPl) 
(Figure 2A and E) (38) at various pHs and salt concentra- 
tions by UV absorbance detected thermal melting experi- 
ments. With increasing temperature, a PNA typically 
dissociates from rHPl at r ml before the melting of 
rHPl at T m2 (T m i < T m2 ) (Figure 3 A and Supplementary 
Figure S8). 

For PNAs P8, J3 and L3, pH dependent thermal 
melting experiments were carried out at 200 mM NaCl. 
In the pH range studied (5.5-9.0), triplex T ml decreases 
with increasing pH. Triplex T ml for PNA L3 is always 
higher than PNAs J3 and P8 (Table 2 and Figure 3C). 
Remarkably, PNA L3 binds to rHPl even at pH 9.0 
with a T m \ of 47. 6° C. The triplex melting transitions 
start to merge with hairpin rHPl melting at pH 7.0 and 
6.0, respectively, for PNAs L3 and P8. Triplex T mX for 
unmodified PNA P8 shows the highest pH dependence 
(Figure 3C). These results are consistent with the fact 
that the apparent pK a for N 3 in a C nucleobase in a 
TFO of a DNA/RNA triplex is significantly shifted up 
to near neutral (9,53). 



Previous research shows that the pK a decrease toward 
neutral for N 3 in 2-thio U (pK a = 8.8) relative to U 
(pK a = 9.3) monomer (Figure ID) may enhance, respect- 
ively, the Watson-Crick (14,54) and Hoogsteen hydrogen 
bonding (38). Consistently, our quantum mechanics cal- 
culations have revealed that the gas phase local pK a values 
for jV 3 atoms decrease upon thiolation (L versus J, 2-thio 
T versus T and 2-thio C versus C; Supplementary Tables 
S2 and S3). Thus, the pK a decrease toward neutral for 
N 3 in L monomer relative to J monomer (pK a = 9.4) 
(19,35,55) may enhance the Hoogsteen hydrogen 
bonding (Figure IB, C, G, and H) (56). 

The stabilization effect of L modification may also 
result from the improved van der Waals contact between 
a sulfur atom in an L nucleobase in a triplex-forming PNA 
(TFPN A) and an H s atom of a guanine in an RNA duplex 
(Figure 1C and H) (38,57,58). Furthermore, a nucleobase 
L (with a relatively more polarizable and less electronega- 
tive thio group) has enhanced stacking interactions with 
flanking nucleobases and has a reduced dehydration 
penalty (14,38,44,54,57-60). 

L modification reduces the salt dependence for 
RNA 2 -PNA triplex formation 

We investigated salt concentration dependent RNA 2 - 
PNA triplex formation for PNAs P8, J3 and L3 by 
thermal melting at pH 7.5. Upon increasing [NaCl], the 
RNA hairpin is stabilized, whereas the triplex is 
destabilized (Table 3 and Figure 3D). The hairpin is 
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Figure 2. Structures studied in this article. Incorporation of L monomers into PNAs selectively stabilizes an RNA 2 -PNA triplex structure shown 
in panels (e) and (k), with minimal formation of alternative structures, e.g. those shown in panels (f-i). N-term: N-terminus. C-term: C-terminus. 
(a-d) Model RNA hairpins rHPl, rHP2, rHP3 and rHP4. (e) A model RNA 2 -PNA triplex formed between rHPl and a PNA. (f) A parallel RNA- 
PNA duplex, (g) An anti-parallel RNA-PNA duplex, (h and i) Two possible strand invasion complexes rHPl-PNA 2 and rHPl-PNA (52). (j) An 
HIV-1 programmed -1 ribosomal frameshift stimulatory RNA hairpin (HIV-HP). (k) A triplex formed between HIV-HP and PNA L4 (LLTTLL). 
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Figure 3. Thermal melting results for RNA 2 -PNA triplexes and RNA-PNA duplexes, (a) Normalized thermal melting curves for rHPl-PNA 
triplexes at 200mM NaCI, 0.5mM EDTA, 20mM HEPES, pH 7.5. (b) Effects of J and L modifications on thermal stabilities of RNA 2 -PNA 
triplex (J ml ) and RNA-PNA duplex (J m ) at 200 mM NaCI, 0.5 mM EDTA, 20 mM HEPES, pH 7.5 (Table 1). PNA L3 shows the highest triplex- 
forming r m | and lowest duplex-forming T m . (c) Effect of pH on thermal stabilities of rHPl-PNA triplexes (Table 2). (d) Effect of [NaCI] on thermal 
stabilities of rHPl-PNA triplexes (Table 3). 



Table 2. pH dependent thermal melting results for rHPl-PNA 
triplexes and rHPl alone 



pH 


rHPl 


-P8 


rHPl 


-.13 


rHPl 


L3 


rHPl 


7ml 




7ml 




7ml 




5.5 


(84.6) 


84.6 


NC 


83.1 


(82.9) 


82.9 


82.4 


6.0 


(84.5) 


84.5 


NC 


84.1 


(84.0) 


84.0 


83.1 


6.5 


62.6 


83.8 


36.1 


84.6 


(83.8) 


83.8 


84.2 


7.0 


51.9 


84.6 


32.6 


84.6 


(84.0) 


84.0 


83.5 


7.? 


38.1 


83.3 


29.9 


84.8 


64.1 


83.9 


84.3 


8.0 


26.9 


84.6 


29.2 


83.8 


55.7 


83.9 


83.6 


8.5 


<20.0 


83.9 


NC 


84.1 


52.7 


84.6 


83.6 


9.0 


<20.0 


82.7 


29.2 


82.6 


47.6 


83.3 


82.5 



Buffers contain 200 mM NaCI, 0.5 mM EDTA, 20 mM MES (pH 5.5 
and 6.0) or HEPES (pH 6.5-9.0). All melting temperatures are shown 
in °C Values shown in parentheses are for melting traces with T ml 
merging with T m2 . NC, the melting is not clear due to a broad 
transition. 



Table 3. [NaCI] dependent thermal melting results for rHPl-PNA 
triplexes and rHPl alone 



[NaCI] 
(mM) 


rHPl 


-PS 


rHPl 


-.13 


rHPl 


-L3 


rHPl 

T m 2 


7ml 


T m 2 


7ml 


T m 2 


7"ml 


7"m2 


10 


(76.2) 


76.2 


(74.5) 


74.5 


(75.9) 


75.9 


74 .1 


100 


46.4 


81.6 


37.9 


82.2 


(82.2) 


82.2 


81.5 


200 


38.1 


84.6 


29.9 


83.5 


64.1 


85.4 


84.3 


300 


26.7 


87.4 


26.1 


85.2 


56.2 


87.4 


87.1 


500 


<20 


88.9 


<20 


89.6 


51.8 


89.3 


88.6 


1000 


<20 


>90 


<20 


>90 


40.1 


>90 


>90 



All buffers contain 0.5 mM EDTA, 20 mM HEPES, pH 7.5. All melting 
temperatures are shown in °C. Values shown in parentheses are for 
melting traces with T ml merging with T m2 . 
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stabilized with increasing [NaCl] because Na is 
condensed upon hairpin stem formation. 

Triplex structures with one or more C + • G-C base triples 
and with all three strands comprising negatively charged 
DNA and/or RNA are destabilized upon increasing [NaCl] 
(25,38,61). This is likely due to the favorable charge- 
charge attraction between C + and phosphate backbone 
decreases with increasing salt concentration, resulting in 
reduced triplex thermal stability. Similarly, PNA P8 
forms less stable RNA 2 -PNA triplex with higher [NaCl], 
mainly due to the formation of three positively charged 
C +- G-C base triples, which have favourable charge- 
charge attraction with negatively charged RNA 
backbone. In addition, all PNAs have one positively 
charged lysine residue (Table 1), which also has charge- 
charge attractions with RNA backbone. Furthermore, 
increasing [NaCl] may decrease the pK a further away 
from neutral for N s in C residues (pK a = 4.5 for C 
monomer) in PNA P8 (25,62). Thus, PNA binding to nega- 
tively charged RNA duplex is weakened upon increasing 
[NaCl]. PNA L3 and J3 show less pronounced [NaCl] de- 
pendent T m i for triplex formation than PNA P8 (Table 3 
and Figure 3D), presumably due to the absence of charged 
nucleobases in the RNA 2 -PNA triplexes containing 
L3 and J3. Taken together, RNA 2 -PNA triplex is signifi- 
cantly stabilized upon L modification in TFPNAs at near- 
physiological conditions with relatively small pH and salt 
concentration dependence. 

Effects of number and position of modifications on 
RNA 2 -PNA triplex formation 

At 200 mM NaCl, pH 7.5, substitutions of a single C with an 
L monomer near to the N-terminus (Ll-2) and middle (Ll- 
4) positions of a PNA increase the T mX by 6.9 and 5.6°C, 
respectively (Table 1 and Figure 3). For bi-modified (L2- 
2,4) and fully modified (L3) PNAs, T m \ values increase by 
17.5 and 26.0°C, respectively. Triplex stability is relatively 
independent of the position of lysine (at N- or C-terminus) 
in a PNA (see L2-2,4 and clL2-2,4 in Table 1). The 8-mer 
oligonucleotides R8 and D8 do not form a stable triplex 
with rHPl (Table 1), which shows that, compared to nega- 
tively charged RNA and DNA, PNAs have significantly 
enhanced affinity towards RNA duplex regions. 

Surprisingly, at 200 mM NaCl, pH 7.5, C-terminus modi- 
fication (Ll-8) decreases T m \ slightly (by 2.1°C) (Table 1 
and Figure 3). This observation may be explained by the 
fact that the terminal base triple of an RNA 2 -PNA triplex 
has a relatively more hydrophilic local environment than 
those of internal base triples. Thus, at the terminus of an 
RNA 2 -PNA triplex, with only one stacking partner, a 
neutral C • G-C base triple or a positively charged C + • G- 
C base triple is more favourable than a neutral and more 
hydrophobic L-G-C base triple. The apparent pK^ of a 
terminal C nucleobase (apparent pK a is ~6-7) in a TFO 
of a DNA triplex is significantly lower than that of an 
internal one (with apparent pK a > 8) (53). Thus, we expect 
the destabilization effect of a terminal L monomer to be 
more significant at low pH. The rHPl-L3 triplex, 
however, is more stable than rHPl-(L2-2,4) triplex with 
an additional C-terminal L-G-C base triple. It is 



probably due to non-nearest neighbor (allosteric) effect 
caused by the two pre-existing L*G-C base triples in the 
rHPl-(L2-2,4) triplex. The advantage of a neutral 
nucleobase L is that the triplex formation has low pH and 
salt dependence, as discussed above. In addition, the 
potential destabilizing effect due to charge-charge repul- 
sion between adjacent positively charged C +, G-C base 
triples (25,58) is not expected for neutral L-G-C base 
triples. 

At pH 8.0 or above, PNA J3 forms a more stable RNA 2 - 
PNA triplex than unmodified PNA P8 (Figure 3 
and Tables 1-3), which is consistent with previous 
DNA-PNA 2 and DNA 2 TFO triplex studies (28,35). 
A destabilization effect of J modification, however, 
is observed at pH < 8, probably due to less favorable 
stacking interactions compared to C or C + . The destabiliz- 
ing effect of J modification was observed at both internal 
and terminal positions. Thus, thiolation of J (to make L) is 
critical in stabilizing RNA 2 -PNA triplexes. 

We further studied RNA 2 -PNA triplex formation by 
non-denaturing PAGE. The fast- and slow-moving bands 
(Supplementary Figure S9, Figure 4) correspond to the 
hairpin and triplex, respectively. PNAs form parallel 
triplexes with the complementary rHPl, whereas control 
oligonucleotides R8 and D8 do not bind to rHPl 
(Supplementary Figure S9, top panel), consistent with our 
thermal melting results. PNAs do not form triplexes with 
an RNA hairpin, rHP2 (Figure 2B), with one G-C pair 
inverted compared to rHPl (Supplementary Figure S9, 
top panel). 

We tested the binding of an anti-parallel PNA apL3 
(Table 1) to rHPl. Our thermal melting results show no 
triplex melting transitions (at >20°C) for apL3 binding to 
rHPl at pH 5.5-7.5 (Supplementary Figure S8C). 
However, at a relatively low temperature (4°C), two 
slow-moving bands are observed in non-denaturing 
PAGE (Supplementary Figure S9, bottom panel). Thus, 
PNA apL3 may form two different complexes with rHPl. 
We speculate that a 5-base-triple parallel triplex and two 
coaxially stacked 5-base-triple parallel triplexes may form 
between rHPl and apL3 (Supplementary Figure S9, 
bottom panel). Interestingly, non-denaturing PAGE 
suggests that apL3 does not bind to rHP2, probably 
because one base triple and the coaxial stacking are 
disrupted (Supplementary Figure S9, bottom panel). 
Thus, coaxial stacking interactions may be utilized to 
enhance the formation of multiple RNA 2 -PNA triplexes 
on one RNA duplex. 

L modification destabilizes RNA-PNA and DNA-PNA 
duplexes 

The relative stabilities of RNA RNA (Figure 2A-D) and 
RNA-PNA (Figure 2F and G) duplexes determine the 
likelihood of strand invasion (Figure 2H and I). 
Modifications that stabilize Hoogsteen base pairs but 
destabilize Watson-Crick base pairs (Figure 1) enhance 
RNA 2 -PNA triplex formation, but minimize RNA- 
PNA duplex formation. We thus carried out thermal 
melting studies at 200 mM NaCl, pH 7.5 for PNAs 
binding to a ssRNA (Table 1) to form parallel and anti- 
parallel RNA-PNA duplexes (Figure 2F and G). 
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Figure 4. Non-denaturing PAGE (12%) with a running buffer of lx TBE, pH 8.3 for 6h at 250V. The incubation buffer is 200 mM NaCl, 0.5 mM 
EDTA, 20 mM HEPES, pH 7.5. The loaded hairpins are at 1 uM in 20 ul. PNA concentrations in lanes from left to right are 0, 0.2, 0.4. 1, 1.6, 2, 4, 
10, 16, 20, 28, 50 and 80 uM, respectively, (a-c) rHPl binds to PNAs P8, J3 and L3 with K d values of (5.3 ± 1.3), (7.0 ± 1.9) and (1.7 ± 0.6) uM, 
respectively, (d-f) rHP2 does not bind to PNAs P8, J3 or L3. (g-i) rHP3 shows no binding to P8 and weak binding to J3, and binds to L3 with K d 
value of (12.2 ± 3.8) uM. (j-1) rHP4 shows no binding to P8, and weak binding to J3 and L3. (m-o) dHP is homologous to rHPl, and shows no 
binding to P8 or J3, and weak binding to L3. (p-r) Binding studies of PNAs P6 (CCTTCC), J4 (JJTTJJ) and L4 (LLTTLL) to HIV-HP. Only PNA 
L4 binds to HIV-HP, with a K d value of (1.1 ± 0.3) uM. 



A gradual decrease in RNA-PNA duplex T m was 
observed with an increasing number of J and L modifica- 
tions (Table 1 and Figure 3B). Remarkably, PNA L3 
shows lowest RNA-PNA duplex T m (27.8°C) [but 
highest RNA 2 PNA triplex T ml (64.1°C)]. In addition, 
T m values are 37.7, 37.7 and <20°C, respectively, for 
PNAs P8, J3 and L3 binding to a homologous ssDNA 
(Table 1, Supplementary Figure S8Y and Z). 

The destabilization effect of L modification on Watson- 
Crick-like RNA-PNA and DNA-PNA duplex formation 
is presumably due to the steric clash between the sulfur 
atom of L and the amino group of G in a Watson-Crick- 
like G-L base pair and loss of two hydrogen bonds 
(Figure IF and I). Thus, incorporation of L monomers 
facilitates tight binding of PNAs to RNA duplexes, but 
not ssRNAs or ssDNAs. 

RNA 2 -PNA and DNA 2 -PNA triplex stability and strand 
invasion property measured by non-denaturing PAGE 

Strand invasion is favored at relatively low salt concentra- 
tion (29-35,52,63). Thus, we tested the possibility of 
strand invasion of PNAs at 10 mM NaCl, pH 5.5. As 



indicated by the non-denaturing PAGE results, PNA P8 
shows strand invasion with rHPl, but PNA L3 forms 
RNA 2 -PNA triplexes without strand invasion 
(Supplementary Figure S10). Both triplex and hairpin 
rHPl band intensities decrease with increasing concentra- 
tion of PNA J3 (Supplementary Figure S10), indicating 
that aggregation may occur at this condition. As 
expected, PNAs P8 and J3 show strand invasion with 
a homologous DNA hairpin (dHP) of rHPl 
(Supplementary Figures Sll and SI 2) at lOmM NaCl, 
pH 5.5. Under the same conditions PNA L3 is still best 
for DNA 2 -PNA triplex formation without strand 
invasion. Thus, L modification minimizes strand 
invasion and stabilizes the formation of both RNA 2 - 
PNA and DNA 2 -PNA triplexes. Interestingly, when 
there is no appreciable strand invasion at 10 mM NaCl, 
pH 5.5, PNAs J3 and L3 bind more tightly to rHPl than 
dHP (Supplementary Figures S10 and SI 2), consistent 
with previously reported results (41,43). It is likely that a 
relatively deep and narrow RNA duplex major groove 
provides geometry compatibility and favorable back- 
bone-backbone interactions with PNA. 
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Table 4. K d (uM) values for PNA binding to RNA or DNA 
duplexes obtained by non-denaturing PAGE 



P8 J3 L3 



rHPl 5.3 ± 1.3 (>20) 7.0 ±1.9 (12.4 ± 4.2) 1.7 ± 0.6 (3.8 ± 1.9) 

rHP2 NB NB NB 

rHP3 NB >50 12.2 ± 3.8 

rHP4 NB >50 >50 

dHP NB NB >50 



The incubation buffer contains 200 mM NaCl, 0.5 mM EDTA, 20 mM 
HEPES, pH 7.5. Values shown in parentheses are for K d 's measured at 
pH 8.0. NB, no binding. 



At 200 mM NaCl, PNAs P8, J3 and L3 do not show 
strand invasion of rHPl with the PNA concentration up 
to ~50uM (Supplementary Figure S10A-C). Thus, we 
quantified the binding affinity by non-denaturing PAGE 
at 200 mM NaCl, pH 7.5 and 8.0. At pH 7.5, the dissoci- 
ation constant (K d ) values are (5.3 ± 1.3), (7.0 ± 1.9) and 
(1.7 ± 0.6) uM for rHPl binding to PNAs P8, J3 and L3, 
respectively (Table 4, Figure 4A-C, Supplementary Figure 
S13). At 200 mM NaCl, pH 8.0, the K d values are >20, 
(12.4 ± 4.2) and (3.8 ± 1.9) uM, respectively, for PNAs 
P8, J3 and L3 binding to rHPl (Table 4, Supplementary 
Figures S14 and S15). The order of the binding affinities is 
consistent with our thermal melting results at both pH 7.5 
and 8.0 (Figure 3 and Tables 1 and 2). 

Non-denaturing PAGE results show weak binding for 
PNA L3, and no binding for PNAs P8 and J3 to the DNA 
hairpin dHP at 200 mM NaCl, pH 7.5 (Figure 4M-0), 
which are consistent with the DNA-binding studies at 
lOmM NaCl, pH 5.5 (as discussed above) and previous 
studies (41,43). We further quantified the sequence speci- 
ficity of RNA duplex recognition by PNA at 200 mM 
NaCl, pH 7.5. Upon changing a G-C pair in rHPl to 
C-G (rHP2), A-U (rHP3) or U-A (rHP4) (Figure 2B- 
D), we observed no appreciable binding for most of the 
PNAs (Figure 4D-L, Table 4). Surprisingly, PNA L3 
binds to rHP3 [K d = (12.2 ± 3.8) uM] at 200mM NaCl, 
pH 7.5 (Table 4, Figure 41, Supplementary Figure S13). 
It is probably because rHP3 has five consecutive A-U 
pairs, resulting in increased RNA-duplex flexibility and 
thus reduced sequence specificity for PNA binding. 
Further studies are needed to better understand the 
sequence specificity at varied sequence contexts. Taken 
together, our thermal melting and gel results suggest 
that L modified PNAs bind tightly and sequence-specific- 
ally to RNA-duplex regions, but not ssRNAs, ssDNAs or 
double-stranded DNAs. 

Binding of L-modified PNA to an HIV-1 programmed —1 
ribosomal frameshift stimulatory RNA hairpin 

We next studied the binding of 6-mer PNAs (Figure 4P-R, 
Supplementary Table SI) to an HIV-1 programmed — 1 
ribosomal frameshift stimulatory RNA hairpin (HIV- 
HP) (Figure 2J and K). In the secondary structure of 
HIV-1 RNA genome (64), the single-stranded 5'-GGAA 
GG-3' sequence occurs frequently, however the duplex 
sequence 5'-GGAAGG-3'/3'-CCUUCC-5' is not found 



outside of the ribosomal frameshift site. Remarkably, 
non-denaturing PAGE results (Figure 4R and 
Supplementary Figures S13-S15) reveal that the 6-mer L- 
modified PNA L4 (LLTTLL) binds to HIV-HP at 200 mM 
NaCl, pH 7.5 and 8.0, with K d values of (1.1 ± 0.3) and 
(2.5 ± 0.6) uM, respectively. PNAs P6 and J4, however, 
show no binding at 200 mM NaCl, at pH 7.0 and 7.5 
(Figure 4P and Q, and Supplementary Figure SI 6). 

CONCLUSION 

In summary, we have developed a method for the synthe- 
sis of a novel PNA monomer thio-pseudoisocytosine (L). 
L-incorporated PNAs show superior affinity and specifi- 
city in recognizing RNA-duplex regions to form RNA 2 
PNA triplexes with minimal formation of RNA-PNA 
duplexes or RNA-PNA 2 triplexes, at near-physiological 
conditions. In addition, L-modified short PNAs show no 
appreciable binding to ssDNA or double-stranded DNA. 
Triplex formation without strand invasion presumably has 
much faster kinetics than complex formation with strand 
invasion (52,63). The promising properties of L-modified 
PNAs suggest that carefully designed L-modified short 
PNAs can be used to specifically target double-stranded 
RNA structures and may thus be useful for mapping 
complex RNA secondary structures (by identifying 
double helices), probing RNA tertiary and RNA-protein 
interactions involving RNA duplex regions (by mapping 
double-stranded regions occluded by tertiary contacts or 
proteins), and specifically stabilizing desired RNA-duplex 
regions, which may prove useful in the application of 
modified PNAs as an RNA-targeting therapeutic (e.g. by 
stabilizing the HIV-1 frameshift hairpin). 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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